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Abstract 

Background: Domestic cultivation of medicinal plants is an important strategy for protecting these species from over 
harvesting. Some species of medicinal plants have been brought into cultivation for more than hundreds years. Concerns 
about severe loss of genetic diversity and sustainable cultivation can potentially limit future use of these valuable plants. 
Genetic studies with comprehensive sampling of multiple medicinal species by molecular markers will allow for assessment 
and management of these species. Here we examine the population genetic consequences of cultivation and domestication 
in Scrophularia ningpoensis Hemsl. We used chloroplast DNA and genomic AFLP markers to clarify not only the effects of 
domestication on genetic diversity, but also determine the geographic origins of cultivars and their genetic divergence from 
native populations. These results will allow both better management of cultivated populations, but also provide insights for 
crop improvement. 

/feiu/ts; Twenty-one cpDNA haplotypes of S. ningpoensis were identified. Wild populations contain all haplotypes, whereas 
only three haplotypes were found in cultivated populations with wild populations having twice the haplotype diversity of 
cultivated populations. Genetic differentiation between cultivated populations and wild populations was significant. 
Genomic AFLP markers revealed similar genetic diversity patterns. Furthermore, Structure analysis grouped all wild 
populations into two gene pools; two of which shared the same gene pool with cultivated S. ningpoensis. The result of 
Neighbor-Joining analysis was consistent with the structure analysis. In principal coordinate analysis, three cultivated 
populations from Zhejiang Province grouped together and were separated from other cultivated populations. 

Conclusions: Ihese results suggest that cultivated S. ningpoensis has experienced dramatic loss of genetic diversity under 
anthropogenic influence. We postulate that strong artificial selection for medicinal quality has resulted in genetic 
differentiation between cultivated and wild populations. Furthermore, it appears that wild populations in Jiangxi-Hunan 
area were involved in the origin of cultivated S. ningpoensis. 



Citation: Chen C, Li P, Wang R-H, Schaal BA, Fu C-X (2014) The Population Genetics of Cultivation: Domestication of a Traditional Chinese Medicine, Scrophularia 
ningpoensis Hemsl. (Scrophulariaceae). PLoS ONE 9(8): el05064. doi:10.1371/journal.pone.0105064 

Editor: Xiao-Yong Chen, East China Normal University, China 

Received February 13, 2014; Accepted July 18, 2014; Published August 26, 2014 

Copyright: © 2014 Chen et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits 
unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. 

Funding: This research was supported by the Key Breeding Project of Zhejiang Province (2012C12912), the National Natural Science Foundation of China 
(31070205), and Science and Technology Development Project of Hangzhou West Lake Scenic Area Management Committee (2012002). The funders had no role 
in study design, data collection and analysis, decision to publish, or preparation of the manuscript. 

Competing interests: The authors have declared that no competing interests exist. 

* Email: schaali^biology.wustl.edu (BAS); cxfu@zju.edu.cn (CXF) 



Introduction 

Plant domestication is one of the great milestones in human 
history. The process of domestication represents a continuum of 
increasing codependence between plants and people [1]. Crop 
species have been derived from their wild progenitors as a result of 
artificial selection for desirable traits by early farmers. In the light 
of crop domestication, a number of agencies now are recom- 
mending that wild medicinal plants be brought into cultivation 
systems [2,3]. Currently, more than 80% of the world's population 
in developing countries depends primarily on herbal medicine for 
basic healthcare needs [4,5] . Given the demand for a reliable and 
uniform supply of medicinal plants and the accelerating depletion 
of forest resources, cultivation of medicinal plants species would be 



an important strategy and a viable alternative to harvesting of 
natural populations [6,7]. 

At present some medicinal plants are grown in home farms, and 
some are cultivated as field crops. But the number of medicinal 
plants species currently in high intensity cultivation for commercial 
production are varied world-wide. In India, 20 species are 
currently under cultivation, in Hungary 40 and in Germany 100 
[6,8,9,10]. China, with its history of thousands years of medicinal 
plant use cultivates only about 250 species [11,12]. Considering 
the harvest pressure on wild populations and the increasing 
demands, the most popular medicinal plants are cultivated under 
the supervision of Chinese government on large scale reaching 
almost 1 million acres [13]. 

As the demand for medicinal plant cultivation increased, many 
studies examined the impact of cultivation on genetic diversity. 
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species such Artemisia judaica [14], Scutellaria baicalensis [15], 
Corydalis yanhusuo [16], Coptis chinensis [17], Magnolia 
officinalis subsp. biloha [18], Fritillaria cirrhosa [19]. However, 
all these studies only used single molecular marker or local 
sampling which just revealed genetic diversity patterns in limited 
populations. Since genetic diversity underlies the plasticity of 
secondary metabolism and hence the production of medicinal 
compounds [20], understanding the population genetics across the 
range of wild populations is essential for utilizing native resources 
for improvement of cultivated species. A study with comprehensive 
sampling of both cultivated and wild populations of medicinal 
species, using both chloroplast and nuclear DNA markers, can 
clarify not only changes in genetic diversity, but also provide 
information on domestication including the geographic origins, 
ancestor populations and the impact of domestication on 
population genetics. 

Plant secondary metabolites are responsible for phytochemical 
diversity and therapeutic efficacy of medicinal species. The 
production of secondary compounds is influenced by both 
environmental and genetic factors [20,21]. Thus geographically 
distinct populations even of the same species may have vastly 
dififerent medical qualities [22] . Modern methods of phytochem- 
ical methods are used to discriminate geographically dispersed 
cultivars within a medicinal species [23,24,25,26,27]. Yet litde is 
known about the genetic differences among the wild populations of 
a medicinal species and whether cultivated populations with high 
pharmaceutical quality have already significantiy diverged from 
wild populations. Additionally, understanding genetic changes 
associated with cultivation not only helps control the quality of 
medicinal herbs, but also provides practical information essential 
for formulating appropriate conservation and cultivation manage- 
ment strategies [28,29]. 

Scrophularia ningpoensis Hemsl. is a perennial herb native to 
Southeastern China. Wild populations are distributed in forests 
along streams, thickets, and tall grasses below alt. 1700 m and 
reproduce from seed [30]. Roots of this medicinal herb have a 
long history of widespread use in China to treat inflammation, 
laryngitis, tonsillitis, abscesses and constipation [31,32,33]. The 
first recorded use in ancient Chinese literature is from ca. 100 BC 
[34]. Harpagosidf;, angroside C, acteoside and cinnamic acid are 
the main bioactive components [35,36,37,38,39]. According to the 
Chinese Pharmacopoeia, only the root of S. ningpoensis is listed as 
the Radix Scrophulariae [40]. Our recent field investigations 
indicated that wild populations of this herb have suffered rapid 
declines, and the species is extirpated in many locations due to 
over exploitation and deterioration of habitats. An analysis of the 
population genetics of wild S. ningpoensis populations is important 
and urgent for appropriate utilization, conservation and preser- 
vation of the wild resources. 

Today, S. ningpoensis is cultivated for its roots on a large scale 
in several regions of Central and Southeastern China. This species 
can trace its cultivation history back to Song dynasty (1000 years 
ago) beginning in backyard gardens [41]. In spite of this long 
written history, the geographic origin of cultivation has not been 
recorded. Formal cultivation programs were initiated in China 
only since the 1950's [42]. Unlike wild populations, cultivated S. 
ningpoensis is propagated vegetatively from rhizome. During the 
cultivation, large rhizomes are often selected for propagation; 
flowers are removed before opening so that resources are allocated 
to vegetative growth, in particular the rhizomes [43,44]. Our 
previous study used HPLC revealed that among cultivated S. 
ningpoensis , accessions from Zhejiang Province produced higher 
concentrations of three main bioactive compounds, suggesting that 
the best quality of cultivated S. ningpoensis is from Zhejiang [45] . 



These results are consistent with the general perspective that S. 
ningpoensis from Zhejiang has the best quality for medical use 
[46]. Besides the phytochemical differences, the unresolved 
question of whether cultivated S. ningpoensis from Zhejiang is 
genetically differentiated for others is also important and 
indispensable to sustainable cultivation of S. ninpoensis. 

Here we used two chloroplast DNA fragments and AFLP 
markers to study the population genetics and phylogeography of 
wild and cultivated S. ningpoensis with the aims of: (1) evaluating 
the genetic: erosion of cultivated populations and the change in the 
pattern of genetic diversity under artificial selection; (2) unraveling 
the geographical origins and ancestral populations of cultivated S. 
ningpoensis; (3) clarifying the genetic divergence between culti- 
vated populations from Zhejiang Province and the rest of 
cultivated S. ningpoensis; and (4) determining which wild 
populations are valuable and can be introduced for improvement 
of cultivation. 

Materials and Methods 

Ethics Statement 

Management Bureau of Mt. Jinggang National Nature Reserve 
issued the permit for Jinggang Mountain (JGW); Management 
Bureau of Mt. Tianmu National Nature Reserve issued the permit 
for Tianmu Mountain (TNW, TMIW, TM2W); Management 
Bureau of Mt. Dapan National Nature Reserve issued the permit 
for Dapan Mountain (TWW). No specific permissions were 
required for other locations which are neither privately owned 
nor protected and the field study did not involve endangered or 
protected species. 

Sampling and DNA extraction 

Field studies were conducted throughout distribution range of 
Scrophularia ningpoensis. Specific locations (GPS coordinates) of 
our field work are indicated in Table 1 . A total of 364 individuals 
representing 13 cultivated and 14 wild populations were sampled 
with nine to twenty-two individuals coUected randomly for each 
population (Table 1, Figure 1). Leaf material was dried in silica gel 
and stored at 4°C. Voucher specimens are deposited in the 
Herbarium of Zhejiang University (HZU), China. Total genomic 
DNA was extracted using a modified (CTAB) method [47,48]. 

Chloroplast (cp) DNA sequences analysis 

After preliminary screening for variation in the intergenic 
spacer (ICS) regions of chloroplast DNA, two regions: trnh-trnF 
[49] and pshA-trnH [50] were chosen for analysis because they 
contained sufficient levels of informative polymorphic sites. PCR 
amplification was performed in an ABI 9700 thermocycler 
(Applied Biosystems). Cycling conditions were 95°C for 2 min 
followed by 35 cycles of 94°C (45 s), 57°C (45 s), 72°C (1 min), 
and concluding extension at 72°C (10 min). The reaction mixture 
(25 nL) contained 2.5 mm/L MgClj, 0.5 ixm/L dNTP, 2.5 jxl 
lOxbuffer, 2.5 |im/L primer, 1 U Taq DNA polymerase [Takara 
Bio Inc.) and 20 ng DNA template. PCR products were identified 
on 1.5% agarose gels in a O.SxTBE buffer, stained with ethidium 
bromide (0.5 |a,g/ml), visualized with ultraviolet light and photo- 
graphed. DNA Marker DL2000 (Takara Bio Inc.) was used. The 
PCR products with a single band were cleaned by a GENE- 
CLEAN II Kit (BIO 101, Inc. Carisbad, USA) for direct 
sequencing. Samples were sequenced in forward and reverse 
directions using the dideoxy chain termination method with 
fluorescent labeling with Big Dye Terminator (version 1.1). 
Sequences were determined with an ABI 377XL DNA sequencer 
and edited using Sequencher version 4.0 (Gene Codes Corp., Ann 
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Table 1. Details of sample locations and sample sizes (N) of Scrophularia ningpoensis. 





Population code 


Locality 


Altitude (m 


Lat. (°N) Long. 
(°E) 


N CpDNA 


N AFLP 


Vouchers 


CULTIVATED 


GZC 


Guangze, Fujian Province 


234 


117°20', 27°32' 


15 


15 


LP0805GZ 


JFC 


Jinfo Mountain, Chongqin 
Province 


1282 


107°12', 29°00' 


14 


13 


FU0810321 


sec 


Guangyuan, Sichuan 
Province 


594 


105°47', 32°27' 


14 


19 


GW20070705 


XJC 


Xianju, Zhejiang Province 


63 


120=44', 28°51' 


10 


10 


CC20060601 


•RCC/PAC 


Pan'an, Zhejiang Province 


356 


120°24', 28°54' 


15 


4 


CC20060602 


•YCC/PAC 


Pan'an, Zhejiang Province 


357 


120°33', 29°05' 


11 


4 


CC20060603 


•SHC/PAC 


Pan'an, Zhejiang Province 


537 


120=39', 29 08' 


10 


4 


CC20060604 


♦DPC/PAC 


Pan'an, Zhejiang Province 


581 


120=32', 28°59' 


14 


4 


CC20060605 


YLC 


Yangling, Shanxi Province 


459 


108=04', 34 15' 


15 


10 


LP2008081 1 


ZPC 


Zhenping, Shanxi Province 


1013 


109=33', 31=45' 


15 


10 


LP20090610 


KKC 


Suiyan, Guizhou Province 


1572 


107=09', 28=13' 


10 


8 


WY20090618 


LOC 


Long Mountain, Hunan 
Province 


843 


109=43', 29=36' 


15 


10 


ST20090601 


NSC 


Enshi, Hubei Province 


1606 


110=12', 30=44' 


13 


10 


ZZ20080721 


WILD 


HNW 


Pingjiang, Hunan Province 


1428 


113=49', 28=38' 


15 


15 


SS20070821 


JGW 


Jinggang Mountain, 
Jiangxi Province 


975 


114=07', 26=35' 


15 


15 


CC0708J029 


JHW 


Jiuhua Mountain, Anhui 
Province 


1246 


117=49', 30=28' 


22 


22 


CC20080507 


JMW 


Junma, Henan Province 


673 


111=25', 33=28' 


13 


13 


SS20070703 


LAW 


Tiantang zhai, Anhui 
Province 


492 


115=47', 31=11' 


15 


14 


LP0906508 


LSW 


Lenshui, Jiangxi Province 


80 


117=11', 28=15' 


15 


15 


SS20070701 


MTW 


Matou, Jiangxi Province 


101 


115=53', 29=27' 


15 


15 


SS20070702 


PTW 


Putuo, Zhejiang Province 


17 


122=23', 30=00' 


9 


9 


CC20080527 


THW 


Tiantai Mountain, Zhejiang 
Province 


960 


121=05', 29=14' 


15 


14 


LP0910919 


TNW 


Tianmu Mountain, 
Zhejiang Province 


1477 


119=25', 30=20' 


15 


14 


LP0906TMS 


TTW 


Tiantong, Ningbo, 
Zhejiang 


348 


121=47', 29=48' 


10 


10 


CC20071001 


TMIW 


Tianmu Mountain, 
Zhejiang Province 


315 


119=26', 30=19' 


10 


10 


CC20061001 


TM2W 


Tianmu Mountain, 
Zhejiang Province 


266 


119=29', 30=19' 


15 


10 


CC20070706 


TWW 


Dapan Mountain, Zhejiang 
University 


469 


120=23', 28=54' 


9 


9 


CC20060901 


•In AFLP analysis, four 


populations RCC, YCC, SHC and DPC are combined as one population coded PAC, anc 


in Figure 1C these four populations are also indicated as 



PAC. 

doi:1 0.1 371 /journal.pone.Ol 05064.t001 



Arbor, MI, USA). Sequences of the two IGS regions were 
combined and aligned by Clustal W version 1.8 [51]. 

Sequences from all individuals were characterized for their 
cpDNA haplotypes. Chloroplast DNA haplotype diversity (h) and 
nucleotide diversity (n) [52] were calculated for each population 
(As, Tts) and overall {h-Y, Ti-p) using DNASP 5.10 [53]. A haplotype 
network rooted by Scrophularia huergeriana that contains all 
linkages with >95% probability of being most parsimonious was 
drawn by TCS version 1.21 [54]. In this analysis, indels and one 
inversion were treated as single mutation events. Phylogenetic 



relationships among cpDNA haplotypes of S. nignpoensis were 
assessed by Bayesian inference (BI), maximum likelihood (ML), 
and maximum parsimony (MP) methods with S. huergeriana, S. 
spicala and S. dentate as the outgroup. For Bayesian inference, the 
substitution model was determined by MrModeltest 2.2 [55] and 
the best-fit model (F81+I+G) was selected by Akaike information 
criterion (AIC). Then Bayesian analysis was performed by 
MrBayes 3.1.2 [56]. Two independent runs of four Markov 
chains each starting with a random tree were processed in ten 
threads simultaneously for 50 million generations, sampling trees 
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S. dentata 



A B C 

Figure 1. Analyses of 21 chloroplast (cp) DNA haplotypes of Scrophularia ningpoensis. (A) The maximum likelihood tree of 21 chloroplast 
(cp) DNA haplotypes of S. ningpoensis, S. buergeriana, S. spicata and S. dentata were used as the outgroup. Statistical supports (Bayesian posterior 
probability >80/maximum likelihood bootstrap value >50/maximum parsimony bootstrap value >50) are indicated on the branches. (B) 95% 
plausible network of the 21 cpDNA haplotypes of S. ningpoensis (A-E1 1). The size of circles corresponds to the frequency of each haplotype. Each 
solid line represents one mutational step that interconnects two haplotypes for which parsimony is supported at the 95% level. The small open circles 
indicate inferred intermediate haplotypes not detected in this investigation. (C) A geographic distribution of 21 cpDNA haplotypes (A-E1 1) detected 
in Scroptiuiaria ningpoensis. The populations correspond to those detailed in Table 1. Squares represent cultivated populations and circles represent 
wild populations. 

doi:1 0.1 371 /journal.pone.01 05064.g001 



at every 1000 generation. The first 12500 sampled trees (25%) 
were discarded as burn-in samples. Maximum likelihood [57] tree 
and ML bootstrap searches were conducted using RAxML V.7.2.8 
[58] on the CIPRES [59]. Final tree was evaluated and optimized 
under GAMMAGAMMA Model. ML bootstrap values were 
estimated fi-om 1000 bootstrap replicates. Maximum parsimony 
analyses were conducted using PAUP* version 4.0b 10 [60]. 
Support values for the relationships were calculated by performing 
bootstrap analyses of 1000 heuristic search replicates using the 
TBR branching swapping algorithm with 100 random additions 
per replicate. The geographical distribution of haplotypes was 
plotted on a map of China using Mapinfo (Pitney Bowes, Inc.). To 
test for significant difference in haplotype fi"equencies between 
wild and cultivated populations, total diversity (Ax), within- 
population diversity (Ag), U test (Nst/Gst) and population 
differentiation (Gst) were calculated in HAPLONST [61]. 
Hierarchical structure of genetic variation was estimated by an 
analysis of molecular variance (AMOVA) on three levels: among 
groups (cultivated and wild groups); among-population within 
groups; and within-population. All AMOVAs were performed 
with ARLEQUIN [62]. Sequence variation was tested for 
deviations from neutrality by Tajima's D statistic [63], and by 
Fu and Li's D* and F* statistics [64] using DNASP 5.10 [65]. 

AFLP analysis 

AFLP method followed the protocol established by Vos [66] 
with minor modifications. First, selective primer pairs were pre- 
screened for three individuals. Second, thirty-six primer combi- 
nations which had produced well-separated fragments in the pre- 
screen were used on eight individuals from different populations to 



determine polymorphisms. Six primer pairs which yielded 
polymorphic, clear and reproducible fragments were selected for 
the final AFLP analysis: £coRLAAC (FAM)/M5«TCTC, EcoRl- 
ACA (FAM)/MseI-CTG, £coRI-ACC (HEX)/Ms«I-CAC, 
£coRI-ACC (HEX)/M.?eTCTA, EcoRl-AGG (TAMAR)/M^eL 
CAG and £coRI-AGG (TAMAR)/M,s«I-CTC. FAM, HEX and 
TAMAR are three differential fluorescence labels for multiplexed 
electrophoresis. 

The protocols are as follows: (i) in the digestion step, 2.5 ul 
(0.25-0.3 ug) of genomic DNA was restricted in 2.5 ul of a 
reaction mixture that contained 0.2 (il Eco R I (15 U/|iL), 0.15 ul 
Mse I (10U/^lL), 0.5 ^ll lOxBufffer and 1.65 ul ddHjO. The 
mixture was kept at 37°C for 3 hours, then 65°C for 10 minutes; 
(ii) the ligation reaction included 5.0 (J.1 digested genomic DNA, 
1 .0 ^1 1 0 X T4 DNA ligase bufier, 1 .0 ^1 Eco R Ladapters (5 |iM), 
1.0 ^1 Mse I adapters (50 ^M), 0.1 ul T4 DNA ligase (3 U/|iL) 
and 1.9 |J,1 ddH20. The ligation reaction was 16°C for 12 h and 
the product was diluted with 90 ^il of ddH20; (iii) pre-selective 
amplifications were carried out in 12.5 |jl reactions: 2.5 |il diluted 
ligation product, 1.25 |xl lOxPCR buffer, 1 ^1 MgClj (25 mmol/ 
L), 2 ul dNTPs (2.5 mmol/L), 0.5 ^il Eco R I - A primer (5 umol/ 
L), 0.5 |xl Mse I - C primer (5 umol/L), 0.05 |il Taq DNA 
Polymerase (5 U/|j,L), 4.7 |J.l ddH20. PGR conditions for pre- 
selective amplification were 72°C for 2 min; 94°G for 20 sec, 56°C 
for 30 sec, 72°C for 2 min, for 20 cycles, 60°C for 30 sec. The 
product of the pre-selective PGR was diluted with 150 |j.l ddHjO; 
(iv) selective amplification reactions included 3 [il diluted pre- 
selective amplification products, 1.25 |J,1 lOxPCR buffer, 1.25 |ll 
MgCl2 (25 mmol/L), 2 ^1 dNTPs (2.5 mmol/L), 0.8 \il Eco R I 
primer (0.5 U/^L), 0.5 ^1 M primer (5 U/^L), 0.5 ul BSA 
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(20mg/mL), 0.1 III Taq DNA Polymerase (5 U/nL), 3.6 nl 
ddHgO. PGR conductions were 94°C for 2 min; 13 cycles of 
94°C for 30 sec, 65°C-56°C (-0.7°C/cycles) for 1 min, 72°C for 
1.5 min; 23 cycles of 94''C for 30 sec, 56°C for 1 min, 72°C for 
1.5 min; 72°C for 8 min. The product of the selective amplifica- 
tion was analyzed on ABI3730 (Applied Biosystems) according to 
the manufacture instructions with GS ROX-labeUed 500 size 
standard (Applied Biosystems). For each primer combination, 
negative amplification controls without template DNA resulted in 
no amplification products. 

Raw AFLP data were scored and collected as presence/ absence 
matrix firstly by Genemarker vl.7 (SoftGenetics LLC) and then 
each locus was manually inspected. Only unambiguously detect- 
able fragments in the size range 75-450 bp were scored. To test 
the reproducibility of the AFLP data, the complete AFLP 
procedure (digestion, ligation, pre-selective, selective amplification) 
was repeated for 15 individuals (5% of the samples) and scored 
independentiy for each primer pair. 

Assuming a set of measures of wild and cultivated populations, 
population genetic statistics were generated by the program 
POPGENE version 1.31 [68] including: the number of polymor- 
phic fragments (Fp), the percentage of polymorphic fragments 
{PPF), Nei's [67] gene diversity (h), Shannon's diversity index [H), 
coefficient of gene differentiation (Ggx - total genetic diversity - 
gene diversity within population/total genetic diversity), and the 
level of gene flow: = 0.5(1 —Gst)/Gst- To explore the 
relationships among individuals Principal Coordinate Analysis 
(PCoA) was performed by software MVSP version 3.1 (Kovach 
Computing Services, Anglesey, Wales) from pairwise Euclidian 
distances between individuals genotypes. A neighbor-joining (NJ) 
tree rooted by the populations of Scrophularia huergeriana was 
constructed by the PHYLIP 3.62 package [69] and group support 
was assessed by a bootstrap analysis with 1000 rephcates. Genetic 
admixture analysis was implemented in STRUCTURE version 
2.2.3 [70] with "no admixture model" and assumed "uncorrelated 
allele frequencies" [71]. The main modeling assumptions of 
STRUCTURE are Hardy-Weinberg equilibrium within popula- 
tions [70]. But the cultivated populations are Hardy-Weinberg 
disequilibrium because of extensively clonal reproduction by 
rhizome within population. So instead of using all cultivated 
populations, only one individual from each different cultivated 
population (total ten individuals) were used for STRUCTURE 
analysis. The number of clusters (K) was set from 2 to 14 (number 
of all wild populations) with 10 replicate runs for each K, a burn-in 
period of 100000, and additional 1000000 rephcates of the 
MCMC chains after burn-in. The structure computations output 
files were carried out by the freely accessible- STRUCTURE 
HARVESTER [72] to calculate similarity coefficients between the 
replicate runs and to plot the mean logarithmic likelihood of K 
values (the means of the estimated log posterior probability of the 
data over the replicate runs for each K value). All populations were 
divided into two groups (cultivated and wild) and analysis of 
molecular variance (AMOVA) [73] was conducted by ARLE- 
QUIN [62] to quantify genetic difierentiation at difierent 
hierarchical levels. "Frequency- down-weighted marker values" 
(DW) [74] quantified by the amount of particular AFLP markers 
in the total dataset were calculated by AFLPdat [75]. 

Results 

CpDNA sequence data 

Total 21 different cpDNA haplotypes (A-U) were identified 
among the 364 individuals from 27 populations of S. ningpoensis. 
A total of 26 polymorphic sites were detected across the two 



chloroplast regions (Table 2); 21 were single-site mutations, four 
were length polymorphisms (4 bp, 9 bp, 7 bp and 2 bp, respec- 
tively) and one was an inversion (22 bp). Considering that 
sequencing poly-N regions could easily cause homoplasies due to 
polymerase error, length variations in mononucleotide repeats 
(poly A or T stretches) were not treated as polymorphisms. The 
trnii-psbA region (21 polymorphic sites detected in 466 aligned 
positions; 4.5%) was more variable than the triiL-tmF region (5 
polymorphic sites detected in 811 aligned positions; 0.62%). The 
combined cpDNA sequences of S. ningpoensis varied in length 
from 1255 to 1268 bp, with a consensus length of 1277 bp after 
Eilignment. When the two fragments were combined, 2.0% of all 
sites (26/1277) were polymorphic. Both Tajima's test (pshArtmYb 
D = -0.42908, P>0.10; tmL-tmF: D = 0.03414, P>0.10) and Fu 
& Li's test IpsbA-trnH: B=i= = 1.26305, P>0.10; F* = 0.68518, P> 
0.10; IrnL-lrnF: D*= 1.01550, P>0.10; P* = 0.80528, P>0.10) 
indicated no significant deviation from neutraht}' for the two 
regions. The combined sequences of the spacers conformed to the 
expectation of neutrality as well (Z) = -0.33903, P>0.10; 
Z)*= 1.51081, 0.10>P>0.05; Pi= = 0.87503, P>0.10). Sequences 
of the eighteen pshA-trnU and six trnL,-trnF haplotypes are 
available in GenBank database (GenBank accession num- 
bers:KJ 1 94 1 43-KJ 1 94 1 66). 

Haplotype diversity and within-population diversity declined 
under anthropogenic influence during the course of cultivation 
(Table 3). Wild populations contained all haplotypes, whereas 
cultivated populations only had three haplotypes (A, B and E). 
Moreover, in wild .S'. ningpoensis, eleven out of fourteen 
populations {TS.6%) were polymorphic for haplotypes. In contrast, 
every cultivated population was fixed for a single haplotype. Wild 
populations had much higher estimates of haplotype diversity 
{hj- — 0.919), the within-population component of diversity 
(As = 0.444) and nucleotide diversity (ht = 0.00301) than do 
cultivated populations (/i j = 0.399 /j.s = 0, tit = 0.00076). In wild 
S. ningpoensis, haplotype diversity varied: populations LSW, 
TM2W had the highest haplotype diversity whereas populations 
TWW, PTW, THW from Zhejiang were each fixed for a single 
haplotype (Figure IC). An AMOVA indicated that 41.02% of die 
total genetic variation occurred between cultivated and wild 
groups (Table 4), which is consistent with the high observed 
genetic differentiation (F57- = 0.71988) between cultivated and wild 
groups. And in cultivated 5. ningpoensis all genetic variation was 
among populations, but in the wild populations, more variation 
occurred within populations (58.54%) than among wild popula- 
tions (41.46%). 

Maximum parsimony analysis of these 2 1 haplotypes, with S. 
huergeriana, S. spicata and S. dentata as the outgroup, resulted in 
501 most parsimonious trees with a length of 87, a consistency 
index (CI) of 0.828, and a retention index (RI) of 0.583. Maximum 
likelihood optimization resulted in a final optimization likelihood 
of -2020.22, with the alpha parameter being 0.020000. The best- 
scoring ML tree had a length of 0.073087. The Maximum 
likelihood tree with statistical support indicated above branches 
was presented in Figure lA, haplotypes of A, B, C, CI, C2, C3, D, 
Dl and D2 were monophyletic but with very low bootstrap 
support. The MP/BI/ML analyses yield mostiy congruent 
topologies. Haplotypes (A, B, E) which cultivated populations 
harbored occur in different clades. The cpDNA haplotype 
network for the 27 populations of S. ningpoensisis is shown in 
Figure IB. The rooted TCS network (Figure IB) showed a similar 
structure to the phylogenetic tree (Figure lA), which grouped 21 
haplotypes into two distinct clades. The outgroup S. huergeriana is 
linked with the center of the whole network by two missing 
haplotypes. Two most common haplotypes A and B are on tip 



PLOS ONE I www.plosone.org 



5 



August 2014 I Volume 9 | Issue 8 | e105064 



Domestication of Scrophularia ningpoensis 



* 01 O 



IN O O 



<<<<<<<< 



U . U 13 u 



<<<<<<<<<<< 



<<<<<<<<<<<<<<<<<< 



3 X 



Q. i/i 

>. QJ 

o -g 



E K 



PLOS ONE I www.plosone.org 



6 



August 2014 | Volume 9 | Issue 8 | el 05064 



Domestication of Scrophularia ningpoensis 



Table 3. Comparisons of genetic diversity and genetic structure between wild and cultivated Scrophularia ningpoensis populations 
based on chloroplast DNA sequences. 





Parameter 


wild 


cultivated 


Number of haplotype 


21 


3 


haplotype diversity {hj) 


0.919 


0.399 


Within-population diversity(/is} 


0.444 


0 


Nucleotide diversityCTij), 


0.00301 


0.00076 



doi:l 0.1 371 /journal.pone.Ol 05064.t0a3 



position linked with haplotype C and D to form one clade, while 
the majority of rare haplotypes forms a "star" shape dominated by 
haplotype E. The geographic distribution and haplotype frequen- 
cies within populations are shown in Figure IC. The most 
frequent haplotypes (A, B and E) were shared by both wild and 
cultivated groups, while other remaining haplotypes were found 
only in wild populations (Figure IC). The most common and 
widest geographically distributed haplotype in cultivated popula- 
tions was A which was unique to the wild population LSW located 
in Lenshui County in northeastern Jiangxi Province. In wild 
populations, there was no clear geographical pattern of within- 
population diversity (Figure IC). Moreover, U test (Nst/Gst) 
indicated no phylogeographic structure in wild S. ningpoensis 
(NsT = 0.446, GsT = 0.534, U=-1.57, P>0.05). 

AFLP data 

After exclusion of ambiguous or irreproducible fragments, 289 
fragments were consistently scored for 24 populations (306 
individuals) of which 261 fragments (90.31%) were polymorphic. 
The test for reproducibility yielded a mean error rate of 1 . 7 % and 
confirmed the reproducibility of the AFLP fragments. Wild 
populations had much higher estimates of genetic diversity 
(A = 0.0791-0. 1614) than did cultivated populations (/i = 0.0076- 
0.0875) (Table 5). In wild S. ningpoensis, the percentage of 
polymorphic fragments varied from 22.15% in population TWW 
to 50.87% in population LAW; in cultivated S. ningpoensis, the 
percentage of polymorphic fragments varied from 3.11% in 
population SCC to 27.68% in population NSC (Table 5). 

The variation between cultivated and wild groups accounted for 
19.62% of the overall genetic variation (Table 6). In the cultivated 



group, most variation (76.97%) existed in among-population with 
only 23.03% due to the within-population (Table 6). But the 
variation distribution in the wild group was opposite that most 
variation (60.40%) existed in within-population with only 39.60% 
due to the among-population (Table 6). 

Principal coordinate analysis also showed a clear differentiation 
between cultivated and wild S. ningpoensis. All populations were 
grouped into three main clusters (Figure 2): cluster I included most 
wild populations; cluster II included most cultivated populations 
and two wild populations LSW & HNW; and cluster III included 
three cultivated populations: PAC, XJC and GZC. Populations 
PAC and JiJC were from Zhejiang province. While GZC is in 
Fujian Province, local farmers indicated that it was direcdy 
introduced from Zhejiang cultivated populations. Genetic admix- 
ture analysis performed by STRUCTURE divided genetic 
variation into two gene pools (k = 2, Figwe 3B): (1) Cultivated 
individuals, wild population HNW and part of wild population 
LSW; (2) Other wild populations. In STUCTURE analysis K 
values ranged from 2 to 1 4 and the one with the highest likelihood 
(k = 2) was chosen (Figure 4). 

The Neighbor-Joining analysis identified two wild populations 
(HNW & LSW) that had a close relationship with cultivated S. 
ningpoensis. The wild population HNW was sister to cultivated 
populations with a bootstrap value of 84%; LSW clustered with 
HNW and all cultivated populations supported by 100% bootstrap 
value (Figure 3A). Although three cultivated populations, PAC, 
JCJC and GZC, grouped together and are distinct from other S. 
ningpoensis in PCoA analysis, in the NJ tree they located in one 
clade with hundred percent support and were associated with 
other cultivated populations. 



Table 4. Hierarchical analysis of molecular variance for 28 populations of Scropliularia ningpoensis based on chloroplast DNA 
sequences. 





Regional grouping 
of populations 


Source of variation 


d.f. 


Sum of squares 


Variance components 


Percentage of 
variation 


F-statistics(P) 


Cultivated 


Among populations 


12 


79.731 


0.50613 


100 


FsT=l(<0.001) 




Within populations 


158 


0 


0 


0 




Wild 


Among populations 


13 


156.171 


0.79368 


41.46 


FsT = 0.41462(<0.001) 




Within populations 


179 


200.576 


1.12053 


58.54 




Cultivated vs. wild 


Among groups 


1 


167.858 


0.87163 


41.02 


Fcr = 0.41023(<0.001) 




Among populations 
within groups 


25 


235.902 


0.65791 


30.96 


fsc = 0.52503(<0.001) 




Within populations 


337 


200.576 


0.59518 


28.01 


FsT = 0.71988(<0.001) 



FsY- genetic differences among populations; Fcj. genetic differences among groups defined a priori; F^c'- genetic differences among population within groups. 
doi:l 0.1 371/journal.pone.Ol 05064.t004 
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Table 6. Hierarchical analysis of molecular variance for 24 populations of S. ningpoensis based on AFLP. 





Regional grouping of 
populations 


Source of variation 


d.f. 


Sum of squares 


Variance components 


Percentage of 
variation 


/^statistics 


Cuitivstsd 


AmoriQ populations 


g 


2251 916 


1 R 7'5Ai^n 
1 o./ ZODU 


76 97 


r^j — U./ oy/ 




Within populations 


111 


666.615 


5.60181 


23.03 




Wild 


Among populations 


13 


2451.695 


12.85266 


39.60 


FsT = 0.39605 




Within populations 


171 


3351.516 


19.59951 


60.40 




Cultivated vs. wild 


Among groups 


1 


1299.331 


7.09433 


19.62 


FcT = 0.19616 




Among populations 
within groups 


22 


4703.610 


15.21535 


42.07 


Fsc = 0.52339 




Within populations 


282 


4018.131 


13.85563 


38.31 


FsT = 0.61 688 



Fst: genetic differences among populations; FcY- genetic differences among groups defined a priori; Fsc'- genetic differences among population within groups. 
doi:l 0.1 371 /journal.pone.Ol 05064.1006 



Discussion 

Change of genetic diversity under domestication 

For many crop species the domestication process has left a 
signature in the pattern of genetic diversity as a result of founder 
effects, genetic bottlenecks and artificial selection [76]. Many crops 
show evidence of genome-wide reductions in variation [77,78,79]. 
In our study, the pattern of genetic diversity between cultivated 
and wild populations of Scrophularia ningpoensis is consistent with 
other domesticated crop species, even though the domestication 
history of 5. ningpoensis is only one thousand years. Out of 
twenty-one cpDNA haplotypes in the species, only three haplo- 
types were detected in cultivated populations. Moreover, cultivat- 
ed populations had little within-population genetic polymorphisms 
(Table 3). The cpDNA nucleotide diversity (tit) and haplotype 
diversity (At) of cxiltivated populations were much less than that of 
the wild group (Table 3). AFLP data also support these patterns 



(Table 5). These results are consistence with many crop-relative 
systems of domestication [80,81] although the degree of decline in 
diversity of S. ningpoensis with such a short history of 
domestication is surprising. This medicinal plant has lost genetic 
diversity as a result of domestication which clearly shows that a 
species can have large genetic alterations even before there are 
significant changes in morphology, phenology or breeding system. 
Other species native to China, such as Corydalis yanhusuo 
(traditional Chinese medicine) [16], Metasequoia glyptostroboides 
[82], Mains pumila [83] and Zizania latifolia [84] also revealed 
the same genetic pattern. 

These large changes in genetic diversity are most likely the result 
of both founder effects and selection for medicinal quality, and 
importandy, the clonal mode of reproduction. The extent of the 
loss of genetic diversity depends in part on population size during 
domestication and time over which domestication occurred [77]. 
At the beginning of cultivation, S. ningpoensis was gathered from 
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Figure 2. Principal coordinate analysis (PCoA) of 306 individuals from 24 populations of 5. ningpoensis based on the Euclidean 
distance generated from AFLP data. To simplify comparison, cultivated S. ningpoensis are indicated by triangles, while wild S. ningpoensis are 
indicated by circles. 
doi:1 0.1 371/journal.pone.01 05064.g002 
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Figure 3. Neighbor-Joining and STRUCTURE analyses of the AFLP data. (A) Neighbor-Joining analysis of the AFLP data for all individuals of 
S.ningpoensis based on Nei's (1979) genetic distances with S. buergeriana as the outgroup. (B) Result of the model from clustering (K=2) of all wild 
individuals and 10 cultivated individuals of S. ningpoensis using STRUCTURE based on the AFLP data set. 
doi:10.1371/journal.pone.0105064.g003 



the wild. Most likely a limited number of individuals from the wild 
formed the founding populations of cultivated S. ningpoensis, the 
founders containing only a sample of the genetic diversity of wild 
populations. Although the domestication history of S. ningpoensis 
is short, under cultivation it is propagated vegetatively from 
rhizome and flowers are removed. These practices restrict gene 
flow between wild and cultivated S. ningpoensis populations and 
among individuals within cultivated populations. Moreover, 
farmers select large and healthy rhizomes to establish new 
populations. Such selection coupled with clonally propagation 
could quickly fix alleles carried by selected individuals. The 
increased Fst values (Table 4, Table 6) for cultivated populations 
(cpDNA: fsT= 1; AFLP; /'st = 0.76974) relative to wild popula- 
tions (cpDNA: /'sT^ 0.41462; AFLP: fsT = 0.39605) reflect the 
diflerent modes of reproduction in cultivated (clonally propagated) 
and wild (sexually propagated) populations. 



Origin of cultivated Scrophularia ningpoer)sis 

Locating the geographical site of domestication and assessing 
the demographic consequences of wHd-domesticated species 
evolution can help to design strategies of use and management 
of genetic resources [85]. In the current study, three cpDNA 
haplotypes (A, B and E) found within cultivated populations are 
scattered on the haplotype network and shared with different wild 
population gene pools. But the most cultivated populations 
harbored haplotype A, which is only found in the wild population 
LSW. Two other haplotypes B & E, distributed in two cultivated 
populations (YLC and SCC) and one cultivated population (NSC) 
respectively, are shared with three and four different wild 
populations respectively. It is interesting that wUd populations 
harboring haplotype B or E didn't have haplotype A. This cpDNA 
pattern indicates that the wild population LSW in Jiangxi Province 
contributed substantially to the origin of cultivated S. ningpoensis, 
while other six wild populations JGW, LAW, JHW, TNW, 
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Figure 4. The estimated mean logaritKimic liltelihood of K values ranging from 2 to 1 4 with 1 0 replicates for each K calculated using 
the R-script Structure-sum. 

doi:10.1371/journal.pone.0105064.g004 



TM2W, PTW) located in geographically separated places were 
only involved in origin of three cultivated populations. Moreover, 
the wild population JGW harboring both haplotype B and E is also 
in Jiangxi Province as the wild population LSW. Thus, the Jiangxi 
region is a critical site for the origin of cultivated S. ningpoensis . 

However, results of AFLP analysis suggested that the origin of 
cultivated populations is more restricted to only two wild 
populations, HNW and LSW. In the neighbor-Joining analysis 
(Figure 3), HNW and LSW clustered with other cultivated 
populations by 100% bootstrap value and the STRUCTURE 
analysis was consistent, placing the two wild populations, HNW 
and LSW, into the same gene pool as the cultivated individuals. In 
principal coordinate analysis (Figure 2), wild population HNW 
and LSW grouped with majority of cultivated populations in one 
cluster. Thus, combined with cpDNA data, we infer that wild 
populations of the Jiangxi region may have played a major role in 
the origin of cultivated S. ningpoensis . Although three cultivated 
populations, PAC, XJC and GZC, grouped to another cluster in 
PCoA analysis (Figure 2), they stiU located in the cultivated clade 
identified in the Neighbor-Joining analysis (Figure 3A) and were 
assigned to the same gene pool as the other cultivated S. 



ningpoensis (Figure 3B). The wild population LSW located in 
eastern Jiangxi and HNW population is on the border between 
Hunan and Jiangxi, so AFLP data strongly supported that Jiangxi 
area of Eastern China is the geographic origin of cultivated S. 
ningpoensis. 

Considering the discordance between chloroplast and nuclear 
markers,the origin of cultivated S. ningpoensis must be interpreted 
with caution. Firstly, patterns of cpDNA indicated broader 
geographic origin of cultivated S. ningpoensis compared with 
AFLP markers, which can be explained by the characteristics of 
chloroplast genome evolution. CpDNA evolves slowly and is 
vmiparentaUy inherited in plants, so it usually only represents the 
maternal genome and often has low levels of variation making 
cpDNA less informative for intraspecific studies [86]. Nuclear 
genomic AFLP markers provided much higher resolution both 
because of rapid evolution and many independent markers, thus 
providing a more nuanced interpretation of the origin of cultivated 
S. ningpoensis. Revealed by AFLP markers, cultivated populations 
originated from Jiangxi area where wild populations were involved 
in the event of origin. Secondly, if AFLP data has revealed the 
more resolution, these two wild populations which were involved 
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in the origin of cultivation, HI^V & LSW, should harbor all 
cpDNA haplotypes (A, B and E) found in cultivated populations. 
But the wild population HNW didn't share any cpDNA haplotvpe 
with cultivated populations; only the native population LSW 
harbored haplotype A. These data suggest that this entire region of 
China may have served as the source of medicinal S. ningpoensis 
rather than a specific population. The ancestors of cultivated 
populations YLC, SCC and NSC were probably introduced from 
wild population similar to the extant HNW; harvesting and 
cultivation may, by chance, collected all individuals which had the 
same cpDNA haplotypes with cultivated populations. In addition, 
the nearest wild population to HNW is JGW, which shared two 
cpDNA haplotypes, B&E, with cultivated populations. But the 
very high value of Dw in HNW (21.92) and JGW (24.79) suggests a 
long-term history of isolation [74]. The long term use, and most 
likely, trading of rhizomes of S. ningpoensis lead to a complicated 
domestication history for this species. 

Crop improvement and sustainable cultivation of S. 

ningpoensis 

The therapeutic efficacy of a medicinal plant often varies by 
geographical location among the populations of a species [22]. 
Cultivated S. ningpoensis from Zhejiang Province is recognized to 
have the best medicinal quality [87]. Our previous study on HPLC 
fingerprints of S. ningpoensis has also revealed that materials 
cultivated in Zhejiang Province produce the highest content of 
bioactive compounds compared to other cultivated populations 
[45]. In this study, analysis of AFLP markers group cultivated 
populations from Zhejiang into one cluster which is divergent from 
other cultivated populations and other wild populations (Figure 2). 
Moreover, cultivated populations from Zhejiang gather in one 
clade associated with other cultivated populations in NJ tree 
(Figure 3 A). This evidence strongly supports the genetic distinct 
nature of cultivated S. ningpoensis from Zhejiang (Figure 2). 
Under strong artificial selection, cultivated S. ningpoensis with 
high pharmaceutical quality has genetically diverged from wild 
populations. It is generally believed that different medical effects 
due to secondary compounds are caused by various environmental 
conditions and habitats where the medicinal plants are grown and 
harvested [B8]. But our study (■()ml)incd with previous phyto- 
chemical analysis indicates that under artificial selection, cultivated 
S. ningpoensis from Zhejiang has become genetically differenti- 
ated from other cultivated populations and chemical diversity may 
also be influenced by genetic factors. Thus, this result can help 
design strategies of crop improvement that consider both genetic 
and ecological factors for the cultivation of medicinal plants. 
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